modify split_qkv_rmsnorm_rope by Liwansi · Pull Request #282 · sgl-project/sgl-kernel-npu

Liwansi · 2025-12-26T02:04:38Z

make the normalization optional to support llama models.

gemini-code-assist · 2025-12-26T02:04:41Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

* upstream/main: modify split_qkv_rmsnorm_rope (sgl-project#282) bump version to 2025.12.25 (sgl-project#281) l2 norm const parameter change (sgl-project#276) Fix the issue of HCCL buffer tiling verification failure during one round of testing. (sgl-project#280)

ZhongsJie · 2025-12-30T12:50:59Z

This PR may affect the current Qwen3 model support. sgl-project/sglang#12078.
Could you help confirm whether compatibility with that change has been considered in our current implementation?
Alternatively, is there a pending SGLang PR that still needs to be merged? @Liwansi

Liwansi · 2025-12-30T14:06:01Z

This PR may affect the current Qwen3 model support. sgl-project/sglang#12078. Could you help confirm whether compatibility with that change has been considered in our current implementation? Alternatively, is there a pending SGLang PR that still needs to be merged? @Liwansi

Yes, I have considered this. The change introduced in this PR makes the normalization component of the split_qkv_rmsnorm_rope operator optional, thereby enabling support for Llama models. A relevant PR has already been submitted to SGLang and is awaiting merge.

ZhongsJie · 2025-12-31T01:06:09Z

@Liwansi Great! Could you please share the related PR link/address?

…pu-old into bugfix * 'a3_topk-1' of https://github.com/luanyundu/sgl-kernel-npu-old: fix dispatch_layout to support topk -1 feature optimize gdn gating and fused_qkvzba_split_reshape_cat (sgl-project#306) fix layout numTokensPerExpertTensor partial Initialization bug (sgl-project#303) Supplement A2 doc, software and hardware compatibility info (sgl-project#294) Added an environment variable to control whether to enable the Combine Ant Migration feature. (sgl-project#304) Support build with cann 8.5 (sgl-project#283) LoRA: Optimization LoRA kernels and refactoring (sgl-project#284) fix a2 single combine aclnn params Resolving the UB out-of-bounds issue caused by A2 dual-machine mixed operation (sgl-project#288) fix notify magic auto-increment bug (sgl-project#291) split_qkv_rmsnorm_rope bugfix (sgl-project#290) Optimize prepare_lens by removing device transfer (sgl-project#289) Fix the performance degradation issue of the single-wheel operation in Ant Moving. (sgl-project#287) modify split_qkv_rmsnorm_rope (sgl-project#282)

modify split_qkv_rmsnorm_rope

d3824f2

Liwansi force-pushed the main_1226 branch from b47a6b3 to d3824f2 Compare December 26, 2025 05:42

iforgetmyname approved these changes Dec 26, 2025

View reviewed changes

iforgetmyname merged commit c7fcd82 into sgl-project:main Dec 26, 2025
2 of 4 checks passed

AndyKong2020 pushed a commit to AndyKong2020/sgl-kernel-npu that referenced this pull request Mar 24, 2026

modify split_qkv_rmsnorm_rope (sgl-project#282)

4488090

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

modify split_qkv_rmsnorm_rope#282

modify split_qkv_rmsnorm_rope#282
iforgetmyname merged 1 commit intosgl-project:mainfrom
Liwansi:main_1226

Liwansi commented Dec 26, 2025

Uh oh!

gemini-code-assist bot commented Dec 26, 2025

Uh oh!

Uh oh!

ZhongsJie commented Dec 30, 2025

Uh oh!

Liwansi commented Dec 30, 2025 •

edited

Loading

Uh oh!

ZhongsJie commented Dec 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Liwansi commented Dec 26, 2025

Uh oh!

gemini-code-assist bot commented Dec 26, 2025

Uh oh!

Uh oh!

ZhongsJie commented Dec 30, 2025

Uh oh!

Liwansi commented Dec 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ZhongsJie commented Dec 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Liwansi commented Dec 30, 2025 •

edited

Loading